K-mean Based Clustering and Context Quantization
نویسندگان
چکیده
In this thesis, we study the problems of K-means clustering and context quantization. The main task of K-means clustering is to partition the training patterns into k distinct groups or clusters that minimize the mean-square-error (MSE) objective function. But the main difficulty of conventional K-means clustering is that its classification performance is highly susceptible to the initialized solution or codebook. Hence the main goal of this research work is to investigate the effective K-means clustering algorithms to overcome this difficulty. An extensive task addressed by this thesis is to design a feasible context quantizer in circumventing the so-called context dilution problem, which is a specific form of K-means clustering problem. Publication P1 presents a genetic algorithm to tackle the k-center clustering problem by using randomized swapping to change one reference vector between the parent solutions in the crossover and then using a local repartition clustering procedure. The algorithm estimates the number of clusters automatically while optimizing the location of the clusters. It has been shown that the algorithm outperforms the other Kmeans algorithms reviewed in the publication.
منابع مشابه
NGTSOM: A Novel Data Clustering Algorithm Based on Game Theoretic and Self- Organizing Map
Identifying clusters is an important aspect of data analysis. This paper proposes a noveldata clustering algorithm to increase the clustering accuracy. A novel game theoretic self-organizingmap (NGTSOM ) and neural gas (NG) are used in combination with Competitive Hebbian Learning(CHL) to improve the quality of the map and provide a better vector quantization (VQ) for clusteringdata. Different ...
متن کاملFuzzy Clustering Approach Using Data Fusion Theory and its Application To Automatic Isolated Word Recognition
In this paper, utilization of clustering algorithms for data fusion in decision level is proposed. The results of automatic isolated word recognition, which are derived from speech spectrograph and Linear Predictive Coding (LPC) analysis, are combined with each other by using fuzzy clustering algorithms, especially fuzzy k-means and fuzzy vector quantization. Experimental results show that the...
متن کاملPrevalent Color Extraction and Indexing
Colors in an image provides tremendous amount of information. Using this color information images can be segmented, analyzed, labeled and indexed. In content based image retrieval system, color is one of the basic primitive features used. In Prevalent Color Extraction and indexing, the most extensive color on an image is identified and it is used for indexing. For implementation, Asteroideae fl...
متن کاملA Novel Approach Towards K-Mean Clustering Algorithm With PSO
In this paper, the proposed approach is an unique combination of two most popular clustering algorithms Particle Swarm Optimization (PSO) and K-Means to achieve better clustering result. Clustering is a technique of grouping homogeneous objects of a dataset with aim to extract some meaningful pattern or information. K-Means algorithm is the most popular clustering algorithm because of its easy ...
متن کاملA Robust Color Image Quantization Algorithm Based on Knowledge Reuse of K-MeansClustering Ensemble
This paper presents a novel color image quantization algorithm. This algorithm improves color image quantization stability and accuracy using clustering ensemble. In our approach, we firstly adopt manifold single k-means clusterings for the color image to form a preliminary ensemble committee. Then, in order to avoid inexplicit correspondence among clustering groups, we use the original color v...
متن کامل